An Early Performance Evaluation of the SiCortex SC648
نویسندگان
چکیده
We report the results of an early performance evaluation of the SiCortex SC648 that contains 648 processor-cores. Performance was measured using microbenchmarks to evaluate node and network performance, and with real-world parallel scientific applications. The SiCortex machines are notable for a number of architectural innovations, including the use of a custom, very low power, six-core processor; a state-of-the-art network interconnect; and the use of a degree-three (three input and three output channel per node) Kautz graph interconnect topology, together enabling high compute density, low power relative to potential compute capability, and fault-tolerant, relatively low contention communication. The purposes of this performance evaluation are two-fold: first to measure absolute performance of the SiCortex SC648 at both node level and with wellcharacterized parallel applications; and second, to obtain an indication of how the systems compare to nominally competing processors—current generation quad-cores from AMD and Intel—on scientific applications of interest.
منابع مشابه
Performance Analysis of the SiCortex SC072
The world of High Performance Computing (HPC) has seen a major shift towards commodity clusters in the last 10 years. A new company, SiCortex, has set out to break this trend. They have created what they claim to be a balanced cluster which makes use of low-power MIPS processors and a custom interconnect in an effort to avoid many of the bottlenecks plaguing most modern clusters. In this paper,...
متن کاملEarly Experiments with the OpenMP/MPI Hybrid Programming Model
The paper describes some very early experiments on new architectures that support the hybrid programming model. Our results are promising in that OpenMP threads interact with MPI as desired, allowing OpenMP-agnostic tools to be used. We explore three environments: a “typical” Linux cluster, a new large-scale machine from SiCortex, and the new IBM BG/P, which have quite different compilers and r...
متن کاملEnabling Loosely-Coupled Serial Job Execution on the IBM BlueGene/P Supercomputer and the SiCortex SC5832
Our work addresses the enabling of the execution of highly parallel computations composed of loosely coupled serial jobs with no modifications to the respective applications, on largescale systems. This approach allows new-and potentially far larger-classes of application to leverage systems such as the IBM Blue Gene/P supercomputer and similar emerging petascale architectures. We present here ...
متن کاملEvaluation of Autogenous Shrinkage in High-Performance Concrete
Recent tendencies in concrete technology have been towards to high- performance concrete with a low water-cement ratio. However, high performance concretes have some problems. One of the problems is early-age cracking due to autogenous shrinkage. This study presents the results of an experimental investigation carried out to evaluate the autogenous shrinkage of high-strength concrete. Accordi...
متن کاملLarge-Scale 3D Phase Field Dislocation Dynamics Simulations On High-Performance Architectures
In this paper we present the development and performance of a three-dimensional phase field dislocation dynamics (PFDD) model for large-scale dislocation-mediated plastic deformation on high-performance architectures. Through the parallelization of this algorithm, efficient run times can be achieved for large-scale simulations. The algorithm’s performance is analyzed over several computing plat...
متن کامل